Data Mining the NCI60 to Predict Generalized Cytotoxicity
نویسندگان
چکیده
Elimination of cytotoxic compounds in the early and later stages of drug discovery can help reduce the costs of research and development. Through the application of principal components analysis (PCA), we were able to data mine and prove that approximately 89% of the total log GI 50 variance is due to the nonspecific cytotoxic nature of substances. Furthermore, PCA led to the identification of groups of structurally unrelated substances showing very specific toxicity profiles, such as a set of 45 substances toxic only to the Leukemia_SR cancer cell line. In an effort to predict nonspecific cytotoxicity on the basis of the mean log GI 50, we created a decision tree using MACCS keys that can correctly classify over 83% of the substances as cytotoxic/noncytotoxic in silico, on the basis of the cutoff of mean log GI 50 = -5.0. Finally, we have established a linear model using least-squares in which nine of the 59 available NCI60 cancer cell lines can be used to predict the mean log GI 50. The model has R (2) = 0.99 and a root-mean-square deviation between the observed and calculated mean log GI 50 (RMSE) = 0.09. Our predictive models can be applied to flag generally cytotoxic molecules in virtual and real chemical libraries, thus saving time and effort.
منابع مشابه
Integrating Constitutive Gene Expression and Chemoactivity: Mining the NCI60 Anticancer Screen
Studies into the genetic origins of tumor cell chemoactivity pose significant challenges to bioinformatic mining efforts. Connections between measures of gene expression and chemoactivity have the potential to identify clinical biomarkers of compound response, cellular pathways important to efficacy and potential toxicities; all vital to anticancer drug development. An investigation has been co...
متن کاملEstimation of Punching Shear Capacity of Concrete Slabs Using Data Mining Techniques
Punching shear capacity is a key factor for governing the collapsed form of slabs. This fragile failure that occurs at the slab-column connection is called punching shear failure and has been of concern for the engineers. The most common practice in evaluating the punching strength of the concrete slabs is to use the empirical expressions available in different building design codes. The estima...
متن کاملA model to predict the sequential behavior of healthy blood donors using data mining
This article has no abstract.
متن کاملPredicting cytotoxicity of PAMAM dendrimers using molecular descriptors
The use of data mining techniques in the field of nanomedicine has been very limited. In this paper we demonstrate that data mining techniques can be used for the development of predictive models of the cytotoxicity of poly(amido amine) (PAMAM) dendrimers using their chemical and structural properties. We present predictive models developed using 103 PAMAM dendrimer cytotoxicity values that wer...
متن کاملPredicting Bankruptcy of Companies using Data Mining Models and Comparing the Results with Z Altman Model
One of the issues helping make investment decisions is appropriate tools and models to evaluate financial situation 0f the organization. By means of these tools, investors can analyze financial situation of the organization and identify financial distress or an ideal condition, they become aware of making decisions to invest in appropriate conditions. The main objective of this study is to ev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of chemical information and modeling
دوره 48 7 شماره
صفحات -
تاریخ انتشار 2008